Advanced Information Retrieval Using XML Standards

نویسندگان

  • Ralf Schweiger
  • Simon Hölzer
  • Joachim Dudeck
چکیده

The bulk of clinical data is available in an electronic form. About 80% of the electronic data, however, is narrative text and therefore limited with respect to machine interpretation. As a result, the discussion has shifted from "electronic versus paper based data" towards "structured versus unstructured electronic data". The XML technology of today paves a way towards more structured clinical data and several XML based standards such as the Clinical Document Architecture (CDA) emerge. The implementation of XML based applications is yet a challenge. This paper will focus on XML retrieval issues and describe the difficulties and prospects of such an approach. The result of our work is a search technique called "topic matching" that exploits structured data in order to provide a search quality that is superior to established text matching methods. With this solution we are able to utilize large numbers of heterogeneously structured documents with only a minimum of effort.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prototyping a Vibrato-Aware Query-By-Humming (QBH) Music Information Retrieval System for Mobile Communication Devices: Case of Chromatic Harmonica

Background and Aim: The current research aims at prototyping query-by-humming music information retrieval systems for smart phones. Methods: This multi-method research follows simulation technique from mixed models of the operations research methodology, and the documentary research method, simultaneously. Two chromatic harmonica albums comprised the research population. To achieve the purpose ...

متن کامل

Statistical Language Models for Intelligent XML Retrieval

The XML standards that are currently emerging have a number of characteristics that can also be found in database management systems, like schemas (DTDs and XML schema) and query languages (XPath and XQuery). Following this line of reasoning, an XML database might resemble traditional database systems. However, XML is more than a language to mark up data; it is also a language to mark up textua...

متن کامل

Efficient Evaluation of Partial Match Queries for XML Documents Using Information Retrieval Techniques

Documents Using Information Retrieval Techniques Young-Ho Park1, Kyu-Young Whang1, Byung Suk Lee2, and Wook-Shin Han3 1 Department of Computer Science and Advanced Information Technology Research Center (AITrc)?? Korea Advanced Institute of Science and Technology (KAIST), Korea fyhpark, [email protected] 2 Department of Computer Science University of Vermont Burlington, VT, USA bslee@...

متن کامل

Measuring Similarity between XML Documents

With the advance of World Wide Web standards, XML documents become popular in e-business applications for information exchange. Electronic catalogs and transaction records are now formatted in XML. XML documents are semi-structured documents with XML schemas marking up the semantics. XML separates presentation from semantics so that presentation of information on different devices can be proces...

متن کامل

Mining Literary Texts by Using Domain Ontologies

This paper describes a query system on texts and literary material with advanced information retrieval tools. As a test bed we chose the electronic version of Dante’s Inferno, manually tagged using XML, enriched with a domain ontology describing the historical, social and cultural context represented as a separate XML document.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Studies in health technology and informatics

دوره 116  شماره 

صفحات  -

تاریخ انتشار 2005